Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation

نویسندگان

  • Guanghan Ning
  • Zhi Zhang
  • Zhihai He
چکیده

Human pose estimation using deep neural networks aims to map input images with large variations into multiple body keypoints which must satisfy a set of geometric constraints and inter-dependency imposed by the human body model. This is a very challenging nonlinear manifold learning process in a very high dimensional feature space. We believe that the deep neural network, which is inherently an algebraic computation system, is not the most effecient way to capture highly sophisticated human knowledge, for example those highly coupled geometric characteristics and interdependence between keypoints in human poses. In this work, we propose to explore how external knowledge can be effectively represented and injected into the deep neural networks to guide its training process using learned projections that impose proper prior. Specifically, we use the stacked hourglass design and inception-resnet module to construct a fractal network to regress human pose images into heatmaps with no explicit graphical modeling. We encode external knowledge with visual features which are able to characterize the constraints of human body models and evaluate the fitness of intermediate network output. We then inject these external features into the neural network using a projection matrix learned using an auxiliary cost function. The effectiveness of the proposed inception-resnet module and the benefit in guided learning with knowledge projection is evaluated on two widely used human pose estimation benchmarks. Our approach achieves state-of-the-art performance on both datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An adaptive estimation method to predict thermal comfort indices man using car classification neural deep belief

Human thermal comfort and discomfort of many experimental and theoretical indices are calculated using the input data the indicator of climatic elements are such as wind speed, temperature, humidity, solar radiation, etc. The daily data of temperature، wind speed، relative humidity، and cloudiness between the years 1382-1392 were used. In the First step، Tmrt parameter was calculated in the Ray...

متن کامل

Structured Prediction of 3D Human Pose with Deep Neural Networks

Most recent approaches to monocular 3D pose estimation rely on Deep Learning. They either train a Convolutional Neural Network to directly regress from image to 3D pose, which ignores the dependencies between human joints, or model these dependencies via a max-margin structured learning framework, which involves a high computational cost at inference time. In this paper, we introduce a Deep Lea...

متن کامل

Human Pose Estimation and Activity Classification Using Convolutional Neural Networks

In this paper, we investigate the problems of human pose estimation and activity classification using a deep learning approach. We constructed a CNN to address the regression problem of human joint location estimation, and achieved a PDJ score of about 60%. Furthermore, using weight initializations from an AlexNet trained to classify on ImageNet, we trained a deep convolutional neural network (...

متن کامل

Human Pose Estimation with CNNs and LSTMs

Human pose estimation from images and videos has been a very important research field in computer vision. In this thesis, we present an end-to-end approach to human pose estimation task that based on a deep hybrid architecture that combines convolutional neural network (CNNs) and recurrent neural networks (RNNs). CNNs used to map the input image to feature space (fixed dimensionality), and then...

متن کامل

Delineation of alteration zones based on artificial neural networks and concentration-volume fractal methods in the hypogene zone of porphyry copper-gold deposit, Masjed-Daghi, East Azerbaijan Province, Iran

In this paper, we aim to achieve two specific objectives. The first one is to examine the applicability of the Artificial Neural Networks (ANNs) technique in ore grade estimation. Different training algorithms and numbers of hidden neurons are applied to estimate Cu grade of borehole data in the hypogene zone of porphyry copper-gold deposit, Masjed-Daghi, East Azerbaijan Province (Iran). The ef...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1705.02407  شماره 

صفحات  -

تاریخ انتشار 2017